Labeling Stress in Continuous Mandarin Speech Perceptually
نویسندگان
چکیده
This paper introduces two independent experiments in which perceptual prominent degree and stress type of syllables are labeled in a speech corpus containing 300 utterances. The perceptual prominent degree obtained in the first experiment provides convincing evidence for the consideration of distinguishing semantic stress and rhythmic stress in the second experiment. The final results show that (1) semantic stress is more prominent than rhythmic stress; (2) rhythmic stress tends to be allocated to the last syllable of the last prosodic word (or foot) in a semantic unit; (3) the location of semantic stress is difficult to be predicted from the prosodic structure of a sentence.
منابع مشابه
Syllabic Intensity Variations as Quantification of Speech Rhythm: Evidence from Both L1 and L2
In this study, three intensity metrics (ΔS.dB, VarcoS.dB and nPVI.dB) devised on the basis of the well-known durational metrics were tested on both L1 (English and Mandarin) and L2 (L2 English). The results suggested that they were effective in distinguishing “stress-timed” English from perceptually “syllable-timed” Mandarin and L2 English (by Mandarin speakers). These metrics break the impasse...
متن کاملUsing prosody to improve Mandarin automatic speech recognition
In this paper, these problems of how to model and train Mandarin prosody dependent acoustic model and how to decode input speech based on prosody dependent speech recognition system will be discussed. We use automatic prosody labeling methods to annotate syllable prosodic break type and stress type on continuous speech corpus, and utilize our proposed methods to train prosody dependent tonal sy...
متن کاملFrom English pitch accent detection to Mandarin stress detection, where is the difference?
Although English pitch accent detection has been studied extensively, there relatively a few works explore Mandarin stress etection. Moreover, the comparison and analysis between Mandarin stress detection and English pitch accent detection have not een touched for such counterpart tasks. In this paper, we discuss Mandarin stress detection and compare it with English pitch accent etection. The c...
متن کاملPerceptual relevance of pitch contours of Mandarin tones and its efficacy in prosody generation of speech synthesis
Modeling Mandarin tones is one of the most important issues in speech synthesis. However, established knowledge is mainly focused on the “production” aspect. In this paper, we first characterized relative pitch levels of tones. Next, two perceptual experiments were designed to investigate “perceptual” relevance of pitch levels and shapes in Mandarin. Results showed that relative pitch levels of...
متن کاملAutomatic prosodic break labeling for Mandarin Chinese speech data
For corpus-based speech synthesis, large quantities of labeled speech are required. Manually labeling speech data is quite labor-intensive. Therefore, automatic speech labeling is highly desired. Prosodic break detection is one of the tasks for automatic speech labeling. In the paper, we propose an automatic break detection algorithm for mandarin Chinese speech. In this approach, we use energy ...
متن کامل